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1. INTRODUCTION 


Problems In remote sensing generate large picture element (pixel) data sets 
from which It Is often necessary to extract a subset for more Intensive analy- 
sis. There Is a tendency, because of classical statistical thinking, to treat 
these subsets as Independent random samples, whereas In practice, there are 
usually Important geographical patterns to be allowed for. In this report, a 
simple method of geographical sampling Is proposed which accounts for a wide 
variety of existing patterns In the pixel data. 


2. SAMPLING OF RECTANGULAR REGIONS 


To estimate a parameter Qf a population which can be thought of as having a 
uniform spatial distribution* It may be necessary to modify the usual consid- 
erations of sampling theory. In particular, there may be correlations between 
spatial neighbors, thus violating the assumptions of Independence. 

There are several possible approaches to dealing with these assumptions. If 
It Is evident that or one Is willing to assume that correlations between 
neighbors In the variables of interest decrease rapidly with distance, then 
small random samples can be taken In the hope that, on the average, the sites 
sampled will be sufficiently separated. Alternatively, a systematic sample 
may be taken at sites that are Intentionally spaced as far apart as prac- 
tical. For example. If the region Is rectangular. It may be subldvlded Into 
Isomorphic subrectangles and a point sampled as near as possible to the center 
of each subrectangle. If the correlations over substantial distances fall to 
be negligible, other strategies are In order. 

If estimation Is to be linear, the x and y coordinates of each sample or power 
thereof may be used as covariates. Alternatively, explicit assumptions about 
the correlations between points may be made, and the estimation adjusted 
accordingly. 

Both of these approaches depend on fairly specific assumptions and can be 
quite complicated to carry out In practice. Accordingly, an alternative 
method is proposed herein allowing for spatial correlations. The method is 
very simple to use and depends on fairly weak assumptions. 


3. ALTERNATIVE METHOD 


Assume that there Is a need to estimate parameters of a large population dis- 
tributed uniformly over a rectangular region R. Let the parameter 

a k V x *y) dxd y 

where f a Is square- Integrable on R. If (x^, y-j) are the coordinates of any 
sample value, then that value Is an estimate of f a (xj, y^). Thus, f a expres- 
ses the spatial dependence of whatever Is being estimated. 

The problem of estimating parameter a from a sample becomes (In this formula- 
tion) a problem In quadrature (i.e., numerical Integration). A random sample 
then gives a Monte Carlo estimate of the Integral 



It Is well known that certain systematic choices of points (x^, y^) give 
better approximations to such an integral than nonsystematlc choices. The 
simplest such set of choices is derived by the midpoint rule. For n x m 
points (x^ , y^.), i ■ 1, •••, n; j ■ 1, m in the rectangle (a,b) x (c,d). 
Choose 

[ x i ■ V <»-•>♦•]” [ y t ■ ^sr <" - c > * '] 

The points chosen are the midpoints of a set of isomorphic rectangles sub- 
dividing the large rectangle. Then 



The error in this approximation goes down as 1/n^, l/m^ and depends on the 
second derivative of f. Because the weights given to sampled points are all 

A 

equal, the variance in a that does not depend on x and y is the same as for a 
random sample of size n x m. 
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It Is known from numerical analysis that If we waive the requirement that the 
systematic sample be evenly spaced and evenly weighted. It Is possible to make 
very large Improvements In the accuracy of a numerical Integration. Gauss- 
Jacobi (ref. 1) Integration specifies a set 

0 < Z 4 < 1 ; 1 ■ 1 , • • • • n 

* » n 


and a set 


0 



such that If 


x, ■ Z, in (b - •) * b ; 

then 

* ‘ £ £ VnV'V V 

This set Is much more economical of sample points than Is the midpoint rule. 
Because the W^ >n sample points are not all equal, the variance, due to factors 
other than x and y values, will be larger than that for the midpoint rule. 
Properties of the Gauss-Jacobl sampling scheme will be derived in the follow- 
ing section. 
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4. GAUSS-JACOBI QUADRATURE 


The Gauss-Jacobl quadrature Is developed In most standard works on numerical 
analysis or orthogonal polynomials [e.g. , Erdelyl (ref. 2)]. However, since 
this method Is apparently not well known among workers In statistical theory, 
some of the more elementary properties are derived below. 

Gauss-Oacobl quadrature on a rectangle Is simply the Cartesian product of 
quadrature on Intervals. Therefore, consider the problem of Integration on an 
Interval [0,1]. It Is well known thit square- Integrate functions can be 
approximated arbitrarily well In the L^-norm by polynomials. Let f e L^ be a 
polynomial of degree n and p, such that 

/J (f - P n ) 2 dx < e small 

Then from Schwartz's Inequality (ref. 1) 

fo f - ta ] 2 < 1 

Thus, approximating f by polynomials allows one to approximate the Integral of 
f by the Integrals of polynomials. 

An orthonormal basis for L^ functions Is given by the Legendre polynomials on 
[0,1]. This Is a sequence of polynomials Lg, Lp L 2 ,***, L fl such that L^ Is 

a polynomial of degree 1; Jg L* ■ 1; /g L^Lj » 0, 1* j; and L^(l) > 0 for all 
1. This set of properties uniquely characterizes the set Lf. Furthermore, 

It Is easy to see that Lg,»-*, L n form a basis for the polynomials of 
degree <n. A further fact: all roots of L^ are real, separated, and lie In 
(0,1). Observe that If L^ had j < 1 roots In (0,1), a polynomial of degree 
j,Pj with those roots could be specified; and, away from the roots. It must 
have either always the same or always the opposite sign to Lj. Thus, 

/g PjLj * 0. But pj- agLg + ajLj + ••• + ajLj for some constants a k , 

so / g PjLj • 0, contradiction. 


Supose It Is desired to use n points of quadrature to estimate f. Let Pz n „i 
by a polynomial of degree 2n - 1 approximating f polntwlse and In L^. Using 
formal division of polynomials, one can decompose 

p 2n-l * L n^t *1 + p n-l 


where 

d n -l and Pn-1 are polynomials of degree n - 1 
Then 


Jo p 2n-l m L n q n-1 * p n-l 

But the first term on the right Is zero, since q can be written as a combi na- 
tion of L^'s where 1 < n and Is orthogonal to both of these. Thus, 

Si P-^i • jJ P n .j and the problem Is rr-^ced to Integrating the lower degree 
polynomial. In order to accomplish this, note that at the points x^, 1 • 1, n 
In (0,1) such that L n (x^) ■ 0, Pg^t^) ■ P n- j ( x ^ ) - By standard considera- 
tions, p n .j Is uniquely determined by Its values at the n points x^; there- 
fore, so Is Its Integral. The process of fitting a polynomial to the 
pairs Cx r P2n-i^ x l^ an< * * nt « 9 r «t 1 ng the polynomial Is readily seen to deter- 
mine a set of weights W fl , depending only on n such that 

>0 W 1 P2n-l tx l > 

The sets (x^>, (W^ > are tabulated and specify the Gauss-Jacobl quadrature 
of order n (ref. 1). It 1$ as accurate as Interpolating a polynomial of order 
2n - 1 In f(x) and Integrating the result. By contrast, the midpoint rule Is 
equivalent to Interpolating a piecewise linear function and Is, therefore, 
prone to much higher error for funct >n f exhibiting global regularity. 
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5. TEST OF THE QUADRATURE APPROACH 


To tost tht relevance of the quadrature approach, 10 Transition Year (TY) seg- 
ments were studied. These segments are rectangles, 117 pixels by 196 pixels 
In dimension. It was hoped to estimate the area proportion of small grains In 
each segment. The function f Is then the density of small -grain cultivation 
near a point. This was approximated by setting f equal to one for an entire 
pixel If the pixel were primarily small grains and to zero If otherwise. 
Ground-truth Information was used to obtain this value. For comparison, a 
6 by 10 grid was used. The grid averaged values over the 60 pixels which were 
numbered an odd number of tens (10, 30, 50, 70) In each dimension. This reg- 
ular grid was the one used In Procedure 1 as the primary type-2 grid. The 
correspondl ng 6 by 10 Gauss-Jacobl grid was that set of pixels nearest the 
n ■ 6 Gauss points In the first dimension and nearest the m ■ 10 Gauss points 
In the second dimension. These pixels and weights are given In table 5-1. 

The estimated proportions of small grains were then compared to "he true 
proportions obtained from the ground-truth studies. These results are given 
In table 5-2. The mean squared error for the Gauss-Jacobl procedure was sub- 
stantially lower than that for the type-2 grid, even though the difference 
was not significant according to an F-test. The Gauss-Jacobl procedure was 
essentially unbiased. It Is Interesting to note that treating each of the 
two estimates as a random sample (Ignoring geographical correlations) would 
predict that the Gauss-Jacobl estimate would average -4/3 as much variance as 
the regular grid because of unequal weights. 
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III. l.MIUlUpi 


TABLE 5-1.- THE PIXEL NUMBER AND WEIGHTS OF THE GAUSS-JACOBI GRID 4 
[Modified from Abramowltz and Stegun (rtf. 1)] 

First coordinate: 117 pixels (n ■ 6) to be sampled 


Points (pixel number) 

Weights 

4 

0.085662246 

20 

.1803807865 

45 

.233956967 

73 

.233956967 

98 

.1803807865 

114 

.085662246 


Second coordinate: 196 pixels (m * 10) to be sampled 


Points (pixel number) 

Weights 

3 

0.03333567215 

14 

.0747256746 

32 

.1095431813 

56 

.1346333597 

84 

.1477621124 

113 

.1477621124 

141 

.1346333597 

165 

.1095431813 

183 

.0747256746 

194 

.03333567215 


Estimate 



f (p]. pf) wj uf 




TABLE 5-2.- RESULTS OF A COMPARISON OF ESTIMATED PROPORTIONS OF 
SMALL GRAINS VERSUS TRUE PROPORTIONS OF SMALL GRAINS 
OBTAINED BY GROUND TRUTH 


H 

Ground truth, 
percent 

Gauss-Jacobi dots 
(small grains) 

Type 2 dots 
(small grains) 

Percent 

<3 

A 

Percent 

<3 

A 

1394 

35.45 

37.96 

2.51 

6.3 

32.14 

-3.31 

10.96 

1457 

47.72 

53.79 

6.07 

36.85 

28.81 

-18.91 

357.59 

1472 

37.95 

25.84 

-12.11 

146.65 

36.21 

-1.74 

3.03 

1518 

34.16 

40.82 

6.66 

44.36 

33.93 

-0.23 

0.05 

1584 

51.58 

49.03 

-2.55 

6.5 

48.33 

-3.25 

10.56 

1602 

30.42 

34.38 

3.96 

15.68 

23.33 

-7.09 

50.27 

1619 

47.91 

53.00 

5.09 

25.91 

38.33 

-9.58 

91.78 

1668 

9.49 

3.88 

-5.61 

31.47 

8.33 

-1.16 

1.35 

1909 

22.35 

22.10 

-0.25 

0.06 

33.33 

10.98 

120.56 

1918 

15.02 

12.67 

-2.35 

5.52 

12.28 

-2.74 

7.51 

Mean bias, 
percent 

Mean squared 
error 

Mean bias, 
percent 

Mean squared 
error 

-.14 

0.00319 

-3.7 

0.00654 
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6. CONCLUSION 


This experiment should be replicated with more segments in order to establish 
whether or not the observed improvement is due to chance. Further considera- 
tion should be given to what other properties of the segment can be estimated 
more efficiently when using Gauss-Jacobl sampling rather than using regular 
sampling. Some possibilities are yield (i.e., small-grains production on a 
more refined scale than 0 and 1) and various parameters of the spectral dis- 
tribution. Since these two possibilities are more nearly continuous than the 
mere presence or absence of grain, Gauss-Jacobl sampling may achieve rela- 
tively better results in these cases. 
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